Comparative Analysis of Mycobacterium tuberculosis pe and ppe Genes Reveals High Sequence Variation and an Apparent Absence of Selective Constraints
نویسندگان
چکیده
Mycobacterium tuberculosis complex (MTBC) genomes contain 2 large gene families termed pe and ppe. The function of pe/ppe proteins remains enigmatic but studies suggest that they are secreted or cell surface associated and are involved in bacterial virulence. Previous studies have also shown that some pe/ppe genes are polymorphic, a finding that suggests involvement in antigenic variation. Using comparative sequence analysis of 18 publicly available MTBC whole genome sequences, we have performed alignments of 33 pe (excluding pe_pgrs) and 66 ppe genes in order to detect the frequency and nature of genetic variation. This work has been supplemented by whole gene sequencing of 14 pe/ppe (including 5 pe_pgrs) genes in a cohort of 40 diverse and well defined clinical isolates covering all the main lineages of the M. tuberculosis phylogenetic tree. We show that nsSNP's in pe (excluding pgrs) and ppe genes are 3.0 and 3.3 times higher than in non-pe/ppe genes respectively and that numerous other mutation types are also present at a high frequency. It has previously been shown that non-pe/ppe M. tuberculosis genes display a remarkably low level of purifying selection. Here, we also show that compared to these genes those of the pe/ppe families show a further reduction of selection pressure that suggests neutral evolution. This is inconsistent with the positive selection pressure of "classical" antigenic variation. Finally, by analyzing such a large number of genes we were able to detect large differences in mutation type and frequency between both individual genes and gene sub-families. The high variation rates and absence of selective constraints provides valuable insights into potential pe/ppe function. Since pe/ppe proteins are highly antigenic and have been studied as potential vaccine components these results should also prove informative for aspects of M. tuberculosis vaccine design.
منابع مشابه
Polymorphisms in the PE35 and PPE68 antigens in Mycobacterium tuberculosis strains may affect strain virulence and reflect ongoing immune evasion.
Previous studies have demonstrated that the Pro‑Glu/Pro‑Pro‑Glu (PE/PPE) genes in strains of Mycobacterium tuberculosis exhibit high sequence variation and may be involved in antigenic variation and immune evasion. Region of Difference 1 (RD1), encoding genes from Rv3871 to Rv3879, was observed to be lost during the original derivation of Bacillus Calmette‑Guérin between 1908 and 1921. It has b...
متن کاملBiochemical characterization of PE_PGRS61 family protein of Mycobacterium tuberculosis H37Rv reveals the binding ability to fibronectin
Objective(s): The periodic binding of protein expressed by Mycobacterium tuberculosis H37Rv with the host cell receptor molecules i.e. fibronectin (Fn) is gaining significance because of its adhesive properties. The genome sequencing of M. tuberculosis H37Rv revealed that the proline-glutamic (PE) proteins contain polymorphic GC-rich repetitive sequences (PGRS) which have clinical importance i...
متن کاملThe PE-PPE Domain in Mycobacterium Reveals a Serine α/β Hydrolase Fold and Function: An In-Silico Analysis
The PE and PPE proteins first reported in the genome sequence of Mycobacterium tuberculosis strain H37Rv are now identified in all mycobacterial species. The PE-PPE domain (Pfam ID: PF08237) is a 225 amino acid residue conserved region located towards the C-terminus of some PE and PPE proteins and hypothetical proteins. Our in-silico sequence analysis revealed that this domain is present in all...
متن کاملFrequent homologous recombination events in Mycobacterium tuberculosis PE/PPE multigene families: potential role in antigenic variability.
The PE and PPE (PE/PPE) multigene families of Mycobacterium tuberculosis are particularly GC-rich and share extensive homologous repetitive sequences. We hypothesized that they may undergo homologous recombination events, a mechanism rarely described in the natural evolution of mycobacteria. To test our hypothesis, we developed a specific oligonucleotide-based microarray targeting nearly all of...
متن کاملComparative genomic and proteomic analyses of PE/PPE multigene family of Mycobacterium tuberculosis H37Rv and H37Ra reveal novel and interesting differences with implications in virulence
Tuberculosis, caused by Mycobacterium tuberculosis, remains a leading infectious disease taking one human life every 15 s globally. The two well-characterized strains H(37)Rv and H(37)Ra, derived from the same parental strain M. tuberculosis H(37), show dramatically different pathogenic phenotypes. PE/PPE gene family, comprising of 176 open reading frames and present exclusively in genus Mycoba...
متن کامل